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ABSTRACT 


The purpose of this research was to compare a two-di- 
mensional polygon discrimination learning (DL) test designed 
by Arima and Young with the Armed Services Vocational Aptitude 
Battery (ASVAB) currently being used for military selection. 

A sample of 65 high school students who had taken the 
ASVAB in the military's High School Testing Program were 
given the DL test during the first two months of 1979. Little 
correlation was found between the two tests. Multiple re- 
gression with IPR as the dependent variable indicated that 
the subtest General Information (GI) predicted Information 
Processing Rate (IPR) score to some extent. For whites only, 
the Armed Forces Qualification Test (AFQT) also served to 
predict IPR score. Results indicate that the DL test measures 
some other aspect of ability other than that demonstrated 
on the ASVAB and may therefore be of use in recruit 


screening. 
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1. INTRODUCTION 


The advent of the all-volunteer force in 1973 dictated 
many changes in personnel management procedures and conditions 
vithin the military. The largest, most immediate effect vas 
that there no longer existed a massive pool from vhich to 
pick individuals meeting high, rather arbitrarily defined 
standards of mental and physical ability. Ina very short 
time, the military services have had to shift from the luxury 
of administratively selecting from an unlimited source to 
the requirement to first create a source through more pro- 
nounced recruiting and then to select a slightly smaller 
number of entrants from this much smaller pool. 

The basic personnel procurement problem facing the military 
is that it has been able only with great effort to meet the 
numerical requirements for new recruits. The present Chief 
of Naval Operations, Admiral Hayward, recently estimated the 
petty officer shortage in the Navy to be reaching 30,000. He 
also stated that this shortfali could possibly result in 
unfilled committments (Sinaiko, 1978). There are myriad 
reasons offered for manning shortfalls ranging from the pass- 
ing of the baby boom to the increasing unattractiveness of 
military vork (Cooper, 1978, Defense Manpover Commission, 
1976). 

There are numerous ways which present and projected short- 


falls can be combatted. Retaining more present members of 





the armed services is perhaps one of the best as vell as 
one of the most difficult. Alloving more vomen into the 
service is another obvious means. Mental, physical, and 
moral guidelines could be relaxed and immediately alleviate 
recruiting problems. There is an ever-increasing list of 
alternatives for manning the armed forces, each with a 
mc. and each with a price. 

There is one alternative, however, wnich is less generally 
considered. That is to improve our present selection pro- 
cedures by devising more effective mental-testing programs. 
The purpose of the following research was to study such a 
possible improvement in the services' enlistment testing. 

Traditionally the measure of an individual's mental 
ability has been taken through the use of various paper-and- 
pencil, verbally oriented tests. The present forms of the 
Armed Services Vocational Aptitude Battery (ASVAB) is one such 
paper-and-pencil test. This battery, when used alone to 
screen applicants for mental ability, suffers by its design 
120 tions. Estes (1974) points out that an individual's 
successful passing of a verbal-type test provides useful in- 
formation as to the extent to which the various prerequisites 
for successful performance have been simultaneously satisfied 
by the individual's combination of inherent capacities and 
past experiences. But failure gives little information 
because there is no indication as to why the individual failed. 


There are, unfortunately, many more reasons other than limited 
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Individuals can possess hixh mental capability and still 
do poorly vhen tested in the traditional manner if they have 
not received the background and training necessary (and 
assumed by the examiner). Words used in the test may be 
strange; the subject matter may be foreign; the general for- 
mat of the test may be new; an individual may give an answer 
which is meaningful in his own frame of reference but not 
within that of the examiner. Two people with equal mental 
potential can make impressively different scores on a test 
of mental ability if only one has been "taught" to take that 
type of test. Thus present tests would seem to be unfair to 
the bright but unorthodox person, to the culturally disad- 
vantaged, and to the naive individual who lacks experience 
in taking standardized tests (Holtzman, 1971). 

Since the ASVAB is one such standardized test, and since 
it is the sole means of identifying those "mentally qualified" 
for duty, it is quite conceivable that the armed forces are 
turning away a substantial group of otherwise qualified young 
men and women simply because of the format of the test. 

Since, also, the culturally disadvantaged and the inexperienced 
test takers tend to be nonwhite in this country, equal oppor- 
tunity requirements are perhaps not being completely satisfied. 

The Equal Employment Opportunity Commission (EEOC), es- 
tablished to enforce the Civil Rights Act of 1964, has looked 
fairly critically at employment testing in recent years. The 


important relevant point in the commission's guidelines is 
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that employment tests should be used only vhen there is 
demonstrated evidence of their validity for the purpose, and 
that tests must not discriminate unfairly against minority 
groups (Federal Register, Vol. 36, 1971). Much progress in 
both mental measurement and validation procedures must be 
realized before these guidelines can be satisfied. Minori- 
ties still score lower on service entrance examinations than 
whites although it has yet to be show that nonwhi tes perform 
less well on the job (Lockman, 1976; Bilinski, 1974). It 
would be well for the services to resolve this incongruity of 
their own volition since there will probably be pressure from 
outside to do so in the future. 

The research presented here was undertaken to study one 
alternative means of testing mental capabilities of potential 
recruits. The test was originally designed by Arima (1978) 
as a test of learning ability. It is a culture-fair or 
culture-free approach in that an attempt has been made to 
limit penalties caused by different cultural backgrounds. 

It was anticipated that a test similar to this culture-fair 
test could be used to supplement the present ASVAB causing 
the new total test battery to be less discriminatory against 


culturally different backgrounds. 
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II. PREVIOUS CULTURE-FREE TEST ATTEMPTS 


There have been numerous attempts at devising culture- 
free devices for testing individuals and predicting various 
qualities or outcomes. Although the approaches have been 
quite different, the goal has been similar: to arrive at some 
measure of a person's abilities in such a way as to not allow 
that person's cultural situation to be a deciding factor in 
the outcome. 

The Porteus Maze Test and the Navy Maze Test (Cory, 1971) 
attempted to measure learning ability, especially in mental 
group IV personnel. The purpose was to help select a portion 
of those who had failed standard service tests to be admitted 
into the military. The testee had to trace his way out of or 
into different mazes and was graded on various procedures 
and responses as well as successful completion. 

Glickman, et al. (1971) examined many tests attempting 
culture-fair prediction of performance in such widely varying 
groups as taxi drivers, college students, insurance salesmen, 
and sewing machine workers. Training simulators, psychomotor 
tests, opinion/personality/interest questionnaires, and work 
sampling were investigated. Unfortunately, it was found that 
even these less traditional devices had either low predictive 
validity or were biased toward racial groups. 

Siegel and Leahy (1974) describe an attempt to predict 


performance in the fleet by a type of job sampling. They 
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vere interested in measuring learning ability on small tasks 
related to the machinists mate rating. Their work produced 
quite encouraging results, however the technique would prove 
hard to use with "mass production" selection processes. The 
time required to screen one individual would be prohibitive. 
Of significance is the fact that they proved that a test of 
learning ability can be used effectively as a nonbiased pre- 
dictor of job performance. 

Cory, et al. (1973) reported an interesting attempt at 
testing recruits undergoing basic training. The recruits 
were tested on their ability to perform tasks they had recent- 
ly learned. The test scores were then correlated with GCT, 
ARI, and AFQT scores. The experiment was discontinued because 
it vas determined that it vas not possible to get enough of 
a distribution in scores on the task performance test. What 
was of interest was that there was low correlation between 
the Recruit Training Test and the more standard GCT/ARI and 
AFQT, suggesting that the two types of tests measured dif- 
ferent abilities. 

Tests which measure ability to learn such as those of 
Siegel and Leahy and Cory seem to hold the most promise.  Un- 
fortunately they suffer from a time and equipment intensiveness 
which limits efficient use on a large scale. What is needed 
is a measure of learning ability which could be taken ina 
short period of time to be compatible with present Armed Forces 
Examining and Entrance Station (AFEES) procedures and time 


constraints. 
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III. THE ARIMA-YOUNG TEST 


Young (1975) and Arima (1978) described a nonverbal dis- 


1 they constructed and admin- 


crimination learning (DL) test 
istered to a group of recruits at the Naval Training Center 
(NTC), San Diego, California. It was decided that a non- 
verbal format would reduce the effect of previous educational 
experiences and cultural background. The specific shapes came 
from work done by Arnoult (1954, 1956) and are shown in 

Figure 1. These shapes were formed from computer generated 
random numbers indicating points which were then connected by 
lines. Shapes of this type were preferable to profiles of 
objects because of the culture-free goal. 

Discrimination learning (DL) was chosen as the format 
because of its possible potential for measuring general 
learning ability as opposed to some more specific ability as 
would be measured in the more common employment testing pat- 
terns. Work sampling, questionnaires, psychomotor tests and 
the like could be expected to have somewhat higher validity 
within one specific occupation. What is needed, though, is 
a test which could measure a more universal quality. The 


armed forces have too many, quite different, job requirements 


E is, simply, learning to discriminate a "correct" choice 
from a group of two or more items. It is usually dore by re- 
peated trials and errors with some reinforcement procedure 
used when the desired choice is made. As more and more 
"correct" choices are made, learning can be shown to be 
taking place. 
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Figure 1. Shapes selected for use in assembling 
stimulus lists. 


(Frøm Arnoult, 1956) 
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to benefit from excessively specific employment aptitude 
tests for selection purposes, 

The DL test has a final qualification vhich is useful in 
the end requirement of screening potential recruits for the 
armed forces, A DL type of test can be BEE et, Ze a 
very short period of time. This characteristic becomes quite 
important when one considers that prospective recruits have 
already a fairly full schedule of mental and physical exam- 
inations during their one day at the AFEES (Navy Recruiting 
Manual, 1978). 

The procedures used for test construction and preliminary 
experiments are well described by Young (1975) and Arima (1978) 
and will not be detailed here. The tests they devised were 
administered to 160 male U.S. Navy recruits at NTC, San 
Diego in order to evaluate the characteristics of the test 
under conditions as close to operational as possible and to 
investigate the appropriateness of the various test parameters 
such as scoring technique, composition, and presentation 
methods. Among the results attained were the following: 
(Young, 1975; Arima, 1978) 

1. Learning did take place during the test. 
2. A self-paced mode of test administration appeared 
to give more useful information than a machine-paced 
| mode. It did not restrict a superior performer. 


3. No significant difference was seen between white and 
nonwhite performance on the self-paced test. 
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4. Performance did not correlate highly with performance 
on standard verbal-oriented intelligence tests (at 
270500 time, the General Classification Test (GCT). 

The reSearch which follows was deSigned to begin where 


this previous work left off. 
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IV. EXPERIMENTAL GOAL 


In the time since recruits from the Naval Training Center 
were administered the Arima-Young test there has been a major 
change in the test batteries used for selection. At that 
time potential enlistees were given the Armed Forces Quali- 
fication Test (AFQT) and some sort of supplemental test bat- 
tery which varied in design and use from service to service. 

Since then, all services have adopted a uniform testing 
program for selection and classification of most new recruits. 
Today each volunteer is given the Armed Services Vocational 
Aptitude Battery (ASVAB) which consists of twelve subtests, 
described in Table 1. Some special selection tests are still 
administered for speciality areas -- nuclear power ratings in 
the Navy, for example. Scores from three of the ASVAB sub- 
tests -- Work Knowledge, Arithmetic Reasoning, and Space 
Perception -- are combined into a general scale of mental 
ability. This combined score is known as the AFQT, a name 
held over from when there was a dedicated test by that name 
as mentioned above. The new AFQT score is used to separate 
volunteers into various mental groups, and it is the primary 
means of determining mental eligibility for military service. 
Scores from all subtests are used for classification purposes 
once basic mental eligibility has been established. 

The purpose of the research described here was to compare 


the Arima-Young test with the present ASVAB in an attempt to 
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determine if their approach to culture-free testing measured 
some different capability than that measured by the ASVAB. 

In order for this new type of test to be useful, it must be 
capable of identifying some new group of individuals who have 


the potential to succeed in military service. 
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Name 


(GI) 
(NO) . 
(AD) 
(WK) 
(AR) 
(SP) 
(MK) 
(EI) 
(MC) 
(GS) 
(SI) 
(AI) 


TABLE 1 


Subtests of the ASVAB Form 5 


of Test 


General Information 
Numerical Operations 
Attention to Detail 
Word Knowledge 
Arithmetic Reasoning 
Space Perception 
Mathematical Knowledge 
Electronic Information 
Mechanical Comprehension 
General Science 

Sep Information 


Automotive Information 


2 


Number 
of Items 


15 
50 
30 
30 
20 
20 
20 
30 
20 
20 
20 
20 


Testing Time 
in Minutes 


07 
03 
05 
10 
20 
11% 
20 
15 
15 
10 
08 


10 





V.MEPEST DESCRIPTION 


A. TEST CONSTRUCTİON 

Six pairs of figures were used, selected from those shown 
in Figure 1. They were the same as Arima and Young's Stimulus 
List 1 which was used in the self-paced phase of their experi- 
ment. This list contained figures previously determined to 
have the least similarity between figures in a pair and also 
the least similarity between pairs (Young, 1975). Figure 2 
depicts the figures used. 

One figure from each pair was choosen to be the correct 
response. The role of a die was used to make this assignment 
which resulted in different correct figures than on the Arima- 
Young test. Order of presentation of the pairs was the same 
for both tests. 

The test was designed so that one pair of figures at a 
time was revealed. The subject chose which figures he or she 
felt was the right response (guessing at first) by pressing 
a clear panel covering that figure. If the correct choice 
was made, the test apparatus would cycle to the next pair. 

If the choice was incorrect, nothing would happen until the 
Nec» chose the other alternative. The above correctional 
procedure was the only reinforcement used. The six pairs 
were each shown ten times in random order for a total of 60 
frames. Table 2 shows the order of pair presentation. The 


entire test was self-paced. 
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Figure 2. Test Figures 
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TABLE 2 


Order of Stimulus Set Presentation 


Figure Order 


Repetition 


2; 1; 5; 6; 3; 4 


1; 2; 6; 4; 3; 5 


6; 


4 
5 


505 
2507210 O; 1 


2. 


IN; 


6; 4; 


7 


z 


6; 4; 3; 1 
6; 
ee 534 


6; 


z: 


DE 


771 1 


5 


DI 


4; 


1; 2; 4; 5; 3 
6 


D: 


55: 


E 


25 


10 


Figure numbers refer to those shown in Figure 1. 


Note. 
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Other constraints used in the construction of the test 
were: 

i. The same two figures were always displayed together. 

2. The left-right order was changed randomly with the con- 
straint that each figure appeared on the left and right 
an equal number of times. 

. Order within each series of six was varled. 


E 
4. All six pairs were presented before one was repeated. 
5 The same pair was never presented back-to-back. 

6 


. All figures retained the same "upright" orientation -- 
l.e., they were never turned around. 
B. TEST HARDWARE 

The equipment used, shown in Figure 3, was capable of 
presenting one pair of figures at a time to the subject, de- 
termining correct and incorrect responses, providing total 
figure exposure time, and counting total correct and incorrect 
responses. All equipment was manufactured by Behavior Con- 

25 institute (BCI). 

The system centered around the BCI Stimulus Response 
Programmer. The programmer was set to operate in the multiple 
choice mode which was capable of displaying four choices 
under four clear panels (channels A, B, C, and D) on the top 
of the machine. A choice was made by depressing one of these 
panels which caused various electrical circuits to be completed 
depending on how the machine was programmed. Since only two 
choices were possible in the present test, the panels corres- 


ponding to channels A and D were deactivated and blocked. 
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Machine programming was accomplished with the use of the 
same block of fan-folded paper that displayed each pair of 
figures. As the paper cycled through the machine exposing 
one pair at a time and stopping until correct selection was 
made, the machine "read" a code punched in one side of the 
paper which indicated the correct response and initiated 
movement. The machine stopped automatically at the end of 
the test. 

A BCI Four Choice Auxiliary Control Console acted as an 
interface between the programmer and a BCI Scoring Indicator 
which cumulatively tallied correct and incorrect responses. 

A timer was added to the system and mounted on the Four Choice 
Auxiliary Control Console and electrically interfaced with it 
in such a way that the timer ran only during actual figure 
exposure time. It would stop when the paper was cycling to 

a new palr. 

An electronic counter was added, built into the rear of 
the Scoring Indicator. It was designed to activate a buzzer 
when six correct responses in a row were made by a subject. 


It became unreliable during the experiment and was not used. 


C. SCORING 

Scoring was done by calculating an Information Processing 
Rate (IPR) for each subject. Each frame was considered to 
BEL tain one bit of” information: the correct figure of å 


displayed pair. A subject processed one bit of information 
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vhen he or she picked the correct figure. Therefore a rate 
of information processing could be computed by dividing total 
correct responses by total exposure time, both of vhich could 
be read directly from the test apparatus. IPR results dis- 
cussed and displayed have been multiplied by 1000 in order to 
allow easier manipulation and discussion. 

A second means of scoring had initially been planned. It 
was hypothesized that another way to measure performance was 
to record how many frames were presented before a subject 
scored six correct responses in a row, indicating he or she 
had "learned" all six pieces of information -- a trials-to- 
criterion measure. The equipment used to signal when six 
correct in a row was achieved malfunctioned early in the ad- 
ministration of the test to subjects, however, and this second- 
ary method of scoring could not be used. It was discovered 
while this scoring system still worked that most subjects 
still missed many responses subsequent to scoring six in 
a row right. Thus it appears that a simple count of the 
number of frames required to score six in a row may have been 
an inaccurate measure of learning. 

It is apparent that no real information processing took 
place for the first six frames since the subject had never 
Seen the pairs before and could only guess which figure in 
each pair was correct. It was felt that no advantage was 


given to any particular individual because of this fact, and 
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in the interest of administration and scoring simplicity re- 
sults of the first six frames vere not and could not be 
separated from the results on the next 54 frames. All scores 
would have been slightly higher had only the last 54 frames 


been used to compute IPR. 
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VI. TEST ADMINISTRATION 


A. SUBJECTS 

In order to most accurately compare ASVAB with the Arima- 
Young test it was necessary to locate subjects who could take 
both tests prior to any selection process for military service. 
Only in that way would the lower spectrum of ability (as 
measured by the ASVAB) be represented for each test. For 
that reason it was not possible to administer the Arima-Young 
test at Recruit Training Centers. Only subjects who had al- 
ready scored sufficiently high on the AFQT would have been 
represented. Even volunteers tested at the AFEESs would have 
been preselected to some extent and would have presented a 
biased sample. 

As part of its recruiting effort the military administers 
a High School Testing Program. A form of the ASVAB is given 
to high school students as a counseling device for school 
counselors. The test is available to all students. While 
no direct recruiting takes place during the testing process, 
the test results are later made available to recruiters, and 
the entire program has proven to be a valuable recruiting 
tool (Navy Recruiting Manual, 1978). It was this group of 
high school students which seemed to offer the best medium for 
comparison of the two tests. 

Three schools on the Monterey Peninsula cooperated in 


allowing this experiment. Pacific Grove High School, Seaside 
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High School, and Alisal High School in Salinas made available 
a total of 65 students vho had taken the ASVAB Form 5 vithin 
the past four months. Table 3 describes the sample by race 
and sex, 

The nonvhites vere represented most heavily by individu- 
als of Spanish descent (11 total). Only one black took the 
test, The rest of the nonwhite group of students is divided 
as follows: Filipino descent (2); Oriental descent (4); 
Native American descent (1); "Other" backgrounds (3). Racial 
group was determined by self report by each subject. 

Students came from grades 9 through 12 with the average 
grade being 10.7. Ages represented were 14 through 18, 


averaging 16.2. 


B. PROCEDURES 

| Subjects were initially given the test instructions in 
groups of four or less. Instructions were read to them con- 
cerning the type of test they would take, what was required, 
how they could determine a correct answer, and other character- 
istics of the test. The test instructions appear in Appendix 
A. Subjects were then shown an example cf what they would 
encounter in the form of a sample test consisting of only two 
pairs. The figures used were from Arnoult's (1956) collection 
shown in Figure 1 and were not the same ones used in the 
actual test. The sample test served to acquaint the students 


with the mechanics of the test apparatus and to help illustrate 
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TABLE 3 


Summary of Student Participation 


SUBJECT SCHOOL TOTAL 
GROUP Seaside Pacific Grove Alisal SUBJECTS 
WHITE (167) (28) (2) (43) 
Male iz 10 0 17% 
Female 10 14 2 26%% 
NONWHITE (5) (10) (7) 22) 
Male 2 7 2 11“ 
Female 3 D 5 LES 
ALL SUBJECTS 22 34 9 65 


*Total males - 28 
**Total females - 37 


DI 





the verbal instructions. Any final questions the group had 
were answered during and after the sample test. 

Individuals took the actual test alone. Last minute 
questions were answered, and the test was started and contin- 
ued until all 10 repetitions were presented. The total 
figure exposure time ranged from 35.5 sec. to 161.1 sec. 

The mean exposure time was 79.1 seconds. Including a 1.4 
second cycle time between pairs, the entire test lasted 


2.7 minutes on the average. 
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. VII. RESULTS 


Raw ASVAB scores were obtained for each subject from the 
schools involved in order to carry out the required analysis. 
A summary of scores (by mean and standard deviation) is pro- 
vided in Table ^ for all subjects and followed by Tables 5 
through 7 for each school. For each figure the number (N) in 
each category is followed by the subtest scores, the composite 
AFQT score and, finally, the score (IPR) from the Arima-Young 
test. A summary of individual IPR and AFQT scores by ethnic 
group is presented in Appendix B. 

Zero-order, product-moment correlation coefficients were 
computed in order to measure the strength of the relationship 
between IPR and ASVAB scores. The results are shown in 
Table 8. The highest subtest correlation can be seen from 
Table 8 to be with the General Information (GI) subtest with 
a correlation coefficient of .34. A low but positive cor- 
relation statistically significant at the .05 level was also 
found for the AFQT. Figure 4 is a scattergram of individual 
IPR scores plotted against the AFQT score. 

Subjects were then divided into two groups depending on 
the AFQT score. Those who scored in the top 50% nationally 
were Separated from those who scored in the lower 50%. Again 
Pearson product-moment correlations were computed to determine 
if one group or the other had closer relations between the IPR 


score and the ASVAB and AFQT scores. The results are shown in 
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TAB 


LE 4 


MEAN TEST SCORES BY RACE AND SEX - ALL SCHOOLS 


male 


a 


10.09 
2.95 


17.36 
7.49 
10.64 
4.43 
8.91 
2.3 
31.91 
SE S 
10.64 
4.23 
14.64 
4.52 
11.64 
375: 
14.64 
3.33 
8.46 
3.39 
9.82 
2.68 
8.91 
2:01 


36 . 46 
12.45 


NONWHITE 


il 


6 
2 


дор 


6 
bi 


5. 
бу. 
an 


36 
TI 


10 


3. 


12 


tə 


Nin PU Fo EU NO 
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12. 


.37 586.64 450 


WHITE 
male female total 
N i % 26 43 
GI 12.29 7.85 8.81 
1.69 1509 2.16 
WK 21:98 17.89 19.46 
5.08 5.60 5.69 
MK 14.47 13.15 13.67 
4.19 4.32 4.27 
GS 12.06 6577-10-16 
m 01 3.03 3.74 
NO £ 0 36.50 36.51 
7.98 8.05 7.92 
AR 13.47 11.96 12.56 
E 3.18 9.67 
FI NES 12.31 15.26 
5.87 eso 5.39 
ST 2,33 8.92 10.49 
2.77 225 27: 
AD 1570 14,73 14.33 
3.87 3.09 3.41 
SP 1277: 10.58 11.30 
5.41 3.69 4.48 
MC 11.94 055 9.77 
3.60 3.05 3.69 
AT Ҹә ZEN 8.02 
4.89 3.03 3.97 
17.22 bo ME 43,38 
APQT 11.36 9.73 10.89 
IPR 659.06 648.00 652 
248.80 265.64 256 


Eg ——>>>-—==2== == = 


SIO 170089152. 


Note. Top number is test mean. 
deviation. 


22 
. 46 8 
“2 3 
46 15. 
79 7. 
2 LI. 
41 ly, 
7 7. 
Du 2. 
.09 34. 
40 10 
25:10 
7 2 
5:1: 
72 3. 
. 91 9 
7:5 3D 
80, 13 
7:: 
. 00 8 
. 38 3 
. 82 7 
17 2 
02 7 
177 3 
46 34, 
DS do. 
0012 
52 175 


female total 


TOTAL 
male female total 
28 3m 65 
Se, 10.24 7.43 8.63 
„12 2.22 1.99 2.50 
DR 20511 16.57 18229 
26 6.41 6.23 6.50 
HB 18.58 12.73 12.83 
86 4.62 4.64 4.59 
82 10.82 8.27 9.37 
07 3.86 3.15 3.68 
307771) 39.38 35.66 
7: 8.84 9.00 8.90 
mae 12.35 11,38 11.60 
(67 4.39 3.8 93.79 
75 16024 12.06 14.008 
80 5.45 3.73 4.88 
oe 12.39 8.5 10.05 
. 86 p. 76 2.59 3.72 
75: 12.24 14.17 
20 3.63 3.69 3.63 
"JB 10596 10.1% 10.43 
. 83 5.05 3.91 4.42 
25 11:11 7.60 0.1 
77 3.38 2.86 3. 54 
36 9.18 6.75 7.80 
„00 4.16 2.86 3.66 
h6 13.32 38.05 40.32 
DEI 12.07 11.03 12.05 
.32 630.60 539.14 607.00 
.69 222.64 252.75 239.32 


Bottom number is test standard 
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TABLE 5 


MEAN TEST SCORES BY RACE AND SEX - SEASIDE HIGH SCHOOL 





WHITE NONWHITE TONAT 

.male female total male female total male female total 

N 7 10 did 2 3 5 2 13 22 
CT 9.29 T6 8.29 9.0 55 6.80 E EMI 7.96 
ES 1.78 T. 20 4.24 2.52 3.42 1.92 2510 2.26 
WK ми (16.68 19.59 20.00 11.33 14.80 23.00 15.39 18.50 
3.89 5.99 6.28 6.66 8.09 7.95 4.27 6.56 6.80 
MK 27771 11.50 12.82 11.00 77:10:00) 15.69 11.00 12.18 
li 2:76 3.64 4.24 6.11 4.90 4.20 3.58 4.02 
22 13.00 2750 10.41 8.00 2235 260 11.99 0321 0007 
75 4.20 4.37 4.24 9.22 3.13 3.95 3.90 4,23 
NO 36.43 36.90 36.71 36.50 37.00 36.80 36:44 36.92 36.73 
me 6.65 8.63 6.36 14.93 11.03 8.35 9.66 8.94 
AR 157) 11.10 12.06 9.50 9567 650 12056 10.77 11.50 
3.87 2.69 3.33 3.54 3.51 3.05 3.97 2.80 3.36 
ET 1589990 10,90 14.24 13.50 14.00 13.80 17.78 11.62 14.14 
708 5.22 Do 0771 2.65 1.92 Zap 4.34 — 6.63 
ST 3.71 7720 10.16 11.66 6.00 5500 17711 7 JET 9.68 
4.86 2001 4.60 | 2.65 3.39 £4.40 2.39 4.38 
13.43 13.40 13.41 14.00 12.33 13.00 13.56 13.15 13.32 
AD h.86 3.47 3.95 5.66 4,73 4.47 4.67 3.60 3.97 
Sp 15707 10:90 11.77 9.50 9.00 oon. 12,07 10.15 11.18 
5.91 755 4.66 0.71 65002 4.32 5.43 3.85 4.62 
MC 11,86 6.90 8.94 7.50 chio 6:0 10:89 6.69 8.41 
4.56 2005 9.71 12.00 1.14 4.40 baoi 13373 
AT 9.00 5570 2:06 10.00 25) ии О 9.22 5.62 7.09 
4.08 2.58 3.58 1.41 1.53 2.86 3.60 2.33 3.87 
AFQT EE 39.00 30.00 33.60 48.22 36.31 41.18 
11242 o 15.00 Duo 11:27 13:61 11.51 10.19 12.07 
IPR 479.00 660.00 585.47 428.50 348.67 380.60 467.78 588.15 538.91 
195.57 251.06 241.31 109.60 104.29 101.75 175.17 260.24 232.51 

Note. Top number is test mean. Bottom number is test standard 


deviation. 
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TĄBLE 6 


MEAN TEST SCORES BY RACE AND SEX - 
PACIFIC GROVE HIGH SCHOOL 





deviation. 


36 


NHITE NONWHITE TOMAT, 

male female total male female total male female total 

N 10 14 24 d 3 10 d 17 34 
GI 21.00 8.14 9.33 10.43 7.33 928 10.77 8.00 9.38 
1.56 1.9% 2526 3.10 2.52 DM 2.25 2.00 2.52 
WK MG c NUS 16021 17.39 18.30 19.77 18.71 19.24 
5.54 5.67 5.55 8.02 9.21 8.02 6.50 Beer ME 27 
MK 05 152559 10.57 11.00 10.70 12.77 14.18 13.47 
4.45 4.64 4.47 6.26 6.93 5.40 5.00 5308 5,02 
as 11.40 0500 10.00 9.43 8.33 9.10 10.59 8.88 9.74 
4.45 2.25 3.48 3.05 Др 3.38 3.95 2.64 3.42 
NO £ 50 35.50 95.96 28.57 27.00 28.10 33.29 34.00 33.65 
7.46 7.52 7.35 9.93 11.79 9.86 651 3.03 8.80 
AR PP” —”?QPÇ—““V. 0001 10:00 10.67 10.20 12.06 12.35 12.21 
4.70 3.43 3.93 4.80 4.04 4.37 4.92 3.50 4.21 
OO 1 AB 14.87 15.00 14.00 14.70 15.59 13.77 14.68 
3.68 2.79 275 4.40 1573 2:71 3.89 2.56 3.37 
25:50 10:21 11.08 12.00 250 102770 12.18 0522 10797 
2.95 2.16 2807 "30 3022 7 3.66 2.46 3.31 
AD 12:75) 1170 11:66 14.00 14.60 14.29 15.18 14.71 
əə 2.41 2.84 2.97 6.25 3. 84 "aO 3.15 3.10 
Sp 1770 11.00 11.25 8.86 9.00 8.90 10.47 10.65 10.56 
5.19 4.17 4.52 3.67 4.58 3.70 Dee 4.17 4.38 
MC 12.00 075 10046 10:57 6.06 9.20 11.41 8.77 10:09 
3.02 3.34 3.41 2.94 1.00 3.29 2.98 3.31 3.38 
AT 9.00 0577 o. oft: 8.71 7.33 8.30 Os 8214 8.32 
2060 777 7 -4.08 3.04 2.89 2.91 4.60 2.72 3.75 
AFQT 777(7)7COU(UC 770770” 37200 37.40 42.29 41.71 42.00 
ви RA ER IE AL 17952. 15.58 13.16 12.00 12.51 
IPR 7770027177 0700 653.14 LS7.67 587.50 726.65 617.06 671.85 
204.15 295.96 265.26 160.94 99.71 165.85 195.82 279.63 244.12 

Note. Top number is test mean. Bottom number is test standard 





TABLE 7 


MEAN TEST SCORES BY RACE AND SEX - ALISAL HIGH SCHOOL 





WHITE NONWHITE TOTAL 
male female total male female total male female total 
N 0 2 2 2 5 7) 2 7 J 
GT 7.00 2000. 10200 6.60 2— 40.00 Or Al 7.44 
1.41 1.41 — 1.82 2551 220) 1.60 2.24 
WK ии ии 10.00 12.40 11.71 10.00 13.57 12.78 
2m 2-12 250: 4.39 3.95 2569 4.20 4.09 
YK 9.50 7:7:577:17:7 7 60 12.71 10.50 12.13 12.00 
4.95 4.95 3. 54 TOI 4,31 3.54 Ди 4.36 
cs MG io oo ESOO 5.40 6.14% 8.00 6.71 7.00 
Lil Li 0 52 le? 0 2.63 2.35 
NO 41.50 41.50 39.00 41.00 240.43 39.00 41.14 40.67 
EE 7.1% 7.16 9.90 7.62 7.53 
AR 1700 11.00 11:00 2270711 00 14.00. 10.14 11.00 
1525 4,2: 0 25500 7 0 5387000 
FI 9.50 9.50 14.50 11.40 12.29 14.50 10.86 11.67 
3.54 3.54 9.19 2.21 4,82 9,19 3013 4,53 
SI 2500 510011200 20 6515 11.00 6271 2.07 
0 0 MAM 1.41 2.34 1571 MSZE 2.24 
a 271770 50 is so 15.00 13.53 14.50 14.00 14.11 
4.95 4.95 4.95 5.05 4,65 4.95 4.90 4.60 
SP 8.00 8.00 6.00 9.00 8.14 6.00 55: SO 
0 0 4.24 4,36 4.22 4.24 3.59 3.66 
MC 8. 50 8.50 9,50 5.60 60 9.50 6.43 (əbi 
3.54 3.54 2212 SZ 2.43 2.12 2577 25: 
AT 4.00 4.00 8.50 5.20 6.14 8.50 4.86 5.67 
0 0 117707) 9:30 4.95 1.95 2.92 
35.50 35.50 30.00 91.20 30.86 30.00 32.43 31.89 
b. 6.36 76.36 “7.07 78.35 7.43 77.07 77.59 7.11 
IPR 565.50 565.50 547.00 506.20 517.86 547.00 523.14 528.44 
207.18 207.18 264.46 189.86 189.97 264.46 178.96 181.31 
Note. Top number is test mean. Bottom number is test standard 


deviation. 
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Table 9. Both groups had similar nonsignificant correlations 
for the AFQT, and the only significant correlations occurred 
on the General Information (GI) and General Science (GS) sub- 
tests for the lower 50% group. The relative absence of sig- 
nificañt correlations is due, in part, to the smaller sample 
sizes. 

The sample was also divided to determine how the IPR of 
whites and nonwhites correlated separately with the ASVAB 
and AFQT scores as shown in Table 10. Again neither correla- 
tion with AFQT was significantly different from zero, and 
significant correlations appeared only for nonwhites on the 
GI, GS, and MC subtests. The difference between groups be- 
comes more significant in this instance because the sample 
size ls smaller for the minority group. 

When males and females were considered separately (Table 
11), none of the male correlations reached statistical signifi- 
cance at the .05 level, but the female IPR scores correlated 
significantly with three ASVAB subtests and the AFQT and 
approached significance (p < .06) on two additional subtests. 

In an attempt to investigate the difference in mean IPR 
and AFQT scores between these various groups, a series of 
significance tests of the difference of two means was carried 
out using Student's t test for independent samples. Means for 
IPR and AFQT were compared between white and nonwhite, male 
and female, and high AFQT 50% and low AFQT 50%. The results 


are shown in Table 12. Significant differences on the IPR 
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vere found only for the ethnic comparison, but the AFQT was 
significantly different for the AFQT grouping and the ethnic 
comparison. The former is trivial, of course, since the 
groups were divided on AFQT. 

Analysis was continued to examine the relationship of 
the paper-and-pencil test variables with the Arima-Young 
test while controlling for sex, ethnic affiliation and school. 
Multiple regression with forward stepwise inclusion was used 
to determine which variables were Significant predictors of 
the dependent variable IPR score. The stepwise inclusion 
process was stopped when the adjusted r” vas a maximum. 
Dummy variables were assigned to represent ethnic (white - 
nonwhite), sex, and school variables. Since GI had been most 
highly correlated in earlier analysis, interaction variables 
were created using GI and the demographic variables to see if 
GI predicted equally for all groups.  AFQT was also used in 
creating interactive variables because of its variance between 
groups and because of its importance in the service selection 
process. Table 13 lists the variables used in the regression 
analysis. Results of the ensuing regression appear in Table 14. 

It can be seen that only two variables are of significant 
use in predicting IPR scores. GI serves as a predictor for 
all subjects. AFQT is an aid in predicting IPR only for whites. 

There iS a possibility that scoring the learning test 
might be improved by weighting accuracy more heavily. The 


computed IPR was a simple division of correct answers by 
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TABLE 13 


VARIABLES USED IN MULTIPLE REGRESSION 


GI Score on GI Subtest 

EI Score on El Subtest 

WK Score on WK Subtest 

MK Score on MK Subtest 

GS Score on GS Subtest 

NO Score on NO Subtest 

AR Score on AR Subtest 

SÅ Score on SI Subtest 

AD Score of AD Subtest 

SP Score on SP Subtest 

MC Score on MC Subtest 

AL Score on AI Subtest 
AFQT Composite Score 

D1 1 1f Seaside High School; O otherwise 
D2 1 if Alisal High School; O otherwise 
D3 1 if white; O otherwise 
D4 1 if Male; O if female 
GID1 GI x D1 

GID2 GI x D2 

GID3 GI x D3 

GID4 GI x D4 

AFD1 AFQT x D1 

AFD2 AFQT x D2 

AFD3 AFQT x D3 

AFD4 AFQT x D4 
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TABLE 14 


REGRESSION ÓF IPR ON MULTIPLE VARIABLES 





Multiple r = .489 r” =" 7239 
Adjusted r= .189 
oard error = 215.550 
Analysis of Variance df SS MS F Sig 
Regression 4  877744.55 219436.14 4.7 .005 
Residual 60 2787705.45 46461.76 
Variables in Equation B Beta Std Error B F Sig 
ör 2822 из 11.83 0a os 
AFD3 3.48 . 33 məzə 7.33 do 
GID1 -12.49 -.21 6.91 3827" Nas: 
EI -8.67 -.18 051 1.98 N.S. 


constant 384.96 





Note. Wilkinson (1979) discusses some of the weaknesses in 
determining the significance of r in stepwise regres- 
sion. In his tables, determined by Monte Carlo Sim- 
ulation of the distribution of r2, the above results 
would not be significant at the .05 level unless r 
was above .31. 
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mall time (multiplied by 1000 for manipulation). Although 
it was not obvious in the experiment, a subject could score 

a high IPR simply by going very quickly and guessing exten- 
sively. A simple means of alleviating this potential problem 
is to introduce a penalty into the scoring. Subtracting the 
number missed from the number correct prior to dividing by 
total time would achieve the desired result. This formula 
was computed for the subjects involved in the experiment and 
found to be fairly consistent with IFR scores computed the 
original way. The two scoring methods' Pearson product moment 
correlation coefficient equalled .89, which exceeds the .01 
level of significance. From Table 15 it can be seen that 
accuracy becomes more important in the second scoring method 
and time ceases to have a significant effect on score. It 
can also be seen that the people who made the highest number 
of correct responses tended to be the ones who also worked 
the fastest. 

A further technique in scoring may have been a still better 
discriminator of ability in this test. As was attempted in 
the early stages of the experiment, a criterion of a certain 
number of correct responses in a row could be used to determine 
test completion. A score could then be derived from either 
time-to-completion or total frames exposed or some combination 
of the two. A major advantage of this scoring method would 
be that the quick learner would not be penalized by having to 


process ten (or however many) trials. There would have to 
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PARDEE 15 


CORRELATIONS OF TEST SCORES WITH TIME AND NUMBER CORRECT 


ADJSCR No. Correct Time 
IPR ‚89 , 37%% -.26* 
ADJSCR . 502 -.19 
No. Correct m dd 


Notes. 1.) IPR = Information Processing Rate = AA 


25775 015504 Score - Ma: corres: - No. Incorrect 


0, susfycant at .05 level. 
sasSignificant at .01 level. 
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be some upper limit of time or frames to preclude test 
marathons, however. 

Internal reliability of the test was measured using a 
split-halves technique described by Bruning and Klintz (1968). 
Information had been taken on 35 of the subjects as to the 
order of correct and. incorrect responses. Odd and even frames 
were split and Pearson's product-moment correlation was cal- 
culated between the two halves. After correction to reflect 
actual test length, a reliability coefficient of .77 was 
obtained. This compares well with a reliability coefficient 


of .84 obtained by Young using a similar method. 
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"Pile DISCUSSTON OF RESULTS 


It is apparent that the Arima-Young test does indeed 
measure some ability other than that measured by the more 
traditional ASVAB. The correlation of .26 between the IPR and 
AFQT scores shows that there is very little similarity between 
the two. 

An individual's AFQT score, which at present determines 
mental qualification for military service, is more an indica- 
tion of knowledge, facts, and processes absorbed by him or her 
up to the time of test administration. No matter what cogni- 
tive processes are involved during the test itself, learned 
skills are required to perform well. If an individual, for 
any reason, does not have these skills, he or she will not 
score well on the ASVAB. 

The Arima-Young test apparently measures a more basic 
quality. Learning was shown to have taken place in the 
original project (Young, 1975; Arima, 1978), and an individual's 
ability to learn could be differentiated by an information 
processing rate (IPR). Their test appears to involve more 
Specifically cognitive factors such as Memory Span, Visual 
Memory, and Perceptual Speed which are discussed by Harman 
(1975). The low correlations in Table 8 between IPR and ASVAB 
subtests indicate a difference in quality measured. 

Since the AFQT score was only weakly associated with the 


IPR score, there should then be some individuals who would 


Gil 





not have been accepted into the military based on ASVAB re- 
sults vho nevertheless displayed above-average learning 
ability as measured by the Arima-Young test. In fact there 
were four subjects in the present sample whose raw AFQT scores 
were below the required mark of 28 (for the Navy) but who, 
nevertheless, received above-average IPR scores of 607. Three 
of these individuals were nonwhite. 

It can also be seen (Figure 4) that there was another 
group which did quite well on the AFQT yet scored below 
average on the IPR. It is hypothesized that these may rep- 
resent people who make up for a lack of superior ability by 
hard studying -- the so-called overachievers. Hard study 
may be effective for paper-and-pencil tests covering tradition- 
al subjects; the Arima-Young test does not reward such 
behavior. 

The point is not that the Arima-Young test is better than 
the ASVAB for service selection purposes, only that it may 
measure a more specific, basic ability which is less depend- 
ent on previous schooling. There must obviously still be 
some sort of selection device such as the ASVAB. 

The military carries on much of its instruction for 
various specialties in traditional classroom environments with 
success heavily dependent on previously developed classroom 
skills. It would not be practical for the services to attempt 
to bring all personnel up to the point of verbal and mathemati- 


cal expertise which are assumed to be held by students entering 
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some of these schools. Therefore some form of testing is re- 
quired to select those vho have achieved the skills necessary 
to do well in continued classroom training. The ASVAB is of 
some use in this respect (Fischl, et al. 1977). 

A test of learning ability becomes important for those jobs 
where less traditional forms of instruction are or could be 
used, those jobs where one learns by watching and doing.  On- 
the-job training requires less of the formal, school skills 
while still requiring an ability to absorb new knowledge. 

A test such as the Arima-Young test could identify, from 
those doing poorly on the ASVAB, the people vho still have 
high potential to learn on the job if accepted into the mili- 
tary. Its potential is as an auxiliary selection tool in a 
time of restricted supply. 

There are certainly many other measures which could be 
taken to insure an adequate enlistment rate into the armed 
forces. Increasing retention, lowering physical standards, 
admitting more women, becoming more capital intensive, or 
reinstituting the draft would all accomplish the same thing, 
all with a different group of people. It is not the goal 
here to identify the best alternatives. 

For the present sample the Arima-Young test was not culture 
free in the sense that whites and nonwhites did not score 
equally. Regressions analysis brought out that there were 
some differences in predictors between the two ethnic groups 


and that the AFQT was a predictor of the IPR score for only 
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the white subjects. This result, vhich is consistent vith 

ME esulis of the original study (Arima, 1978; Young, 1975), 
permits the use of a single equation to predict the IPR scores 
of both groups and is culture-free in this respect. 

It may be that it is impossible to devise a test of any 
kind useful for service selection purposes that does not have 
an adverse cultural impact. Why performance on the present 
test was not equal for both ethnic groups can only be conjec- 
tured. The test was still a test, and those more experienced 
with tests may still have an advantage. Perceptions of author- 
ity, if different for each group, may have an affect on the 
test results. The race of the test administrator and the 
test environment may be influential factors. The language 
of the instructions, the words and phrases used, could give 
an edge to one group. If there is indeed an ethnic difference 
in IPR scores for the Arima-Young test that transcends the 
present sample for high school students, it would be interest- 
ing to follow age groups down through the primary grades to 
determine if this difference is inherent or if it is in some 
way developed or learned. 

In order to judge its final potential, the Arima-Young 
test must first be submitted to a test of its predictive 
validity against some final criterion. The immediate problem 
of course is that of finding any reliable and valid criterion 
of job performance. Supervisor ratings could immediately 


bring race back as an issue; there would be no final school 
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grades, and grades are not really a measure Of performance 

on the job; advancement is based on paper-and-pencil tests, 
something the subject has already proven he does poorly. Per- 
haps one must settle as Lockman (1975) did on a simple measure 
of survival. A person is successful lf he or she is still on 
the job some time in the future. If, somehow, the test proves 
to be a valid predictor of job success, then any ethnic dif- 
ference in scores could still be the basis for adverse impact 


if it is used indiscriminately for selection purposes. 
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IX. CONCLUSIONS 


The following conclusions may be drawn: 

1. The Arima-Young test can in fact identify, from those 
who score poorly on the AFQT, a group of individuals who have 
high learning ability, at least within the narrow parameters 
of the present test. 

2. Whites scored higher than nonwhites on the IPR test. 

A final determination of culture-fairness must await valida- 
tion of test results with some measure of success on the job 
as a criterion. 

3. Study of ethnic results for various age groups starting 
at an early age may give further insight as to the reasons for 
ethnic differences in score at the high school level. 

4. A more useful score may be derived by subtracting in- 
correct answers from correct ones prior to computing an Infor- 


mation Processing Rate (IPR). 
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APPENDIX A 
TEST INSTRUCTIONS 


The test you will soon see takes less than five minutes. 
There is no passing or failing score because the test is being 
used for research purposes only. 

You will be seated in front of a machine which will show 
you two figures at a time. The figures were drawn by computer 
and are not supposed to represent any physlcal objects. For 
each pair of figures I have picked one to be the right answer 
and one to be the wrong answer. You will be shown a number of 
pairs of these figures. For each pair I would like you to 
try to pick the correct figure by pressing the clear panel 
directly over that figure. If the machine moves to the next 
pair, you know you have picked the correct figure. If nothing 
happens, you should pick the other figure by pressing the clear 
panel over it. Pushing both panels at the same time causes 
an error. 

You will see the same pairs of figures many times. The 
first time you see a Pair you can only guess which of the two 
figures I have chosen as correct. After you have guessed, try 
to remember which figure was correct so that you can pick that 
one the next time you see it. The same two figures will al- 
ways be together, and the correct figure will remain the 
same. At times I have flip-flopped the pairs so that the 


correct answer may be on either tre left or the right side. 


DIA 





b = © 
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Although a timer is being used, this is not a timed test. 
Just work as quickly as you can without rushing. From time 
to time a buzzer may sound. Ignore it; it is simply built 
into the system and does not affect you. 


Here is an example of what you will see: 
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SUBJECT 


APPENDIX B 


INDIVIDUAL IPR AND AFQT SCORES 


ETHNIC IPR AFQT 
W 588 49 
W 840 6 5 
W 882 46 
W 534 45 
W 475 41 
W 223 45 
W 506 46 
W 446 22 
W 400 42 
N 261 22 
W 833 40 
N 321 19 
W 1090 30 
W 469 43 
W 466 39 
N 361 32 
W 057 LÅ 
W 511 62 
W 379 58 
W 412 35 
N 464 49 
W 448 DO 
W 764 52 
W 779 24 
N 583 po 
W 829 57 
W 584 29 
W 1020 40 
W 318 54 
N 431 55 
N 568 36 
W 644 İLİ, 
W 398 42 
W 652 48 
N 442 43 
W 492 43 
N 706 26 
N 822 63 
W 988 52 
N sy, LO 
W 36 5 28 
N 4 56 24 


DO 
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APPENDIX B 
(continued) 


ETHNIC IPR 


AZ ĄRKĄZAĄ A RZ Z SĄ 4 Z, 5, Z, 5 5 Z, ZE, Z, 
Ui 
E 
O 


White 
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